A Linguistic Search Tool for Semitic Languages
نویسنده
چکیده
The paper discusses searching a corpus for linguistic patterns. Semitic languages have complex morphology and ambiguous writing systems. We explore the properties of Semitic Languages that challenge linguistic search and describe how we used the Corpus Workbench (CWB) to enable linguistic searches in Hebrew corpora.
منابع مشابه
Bayesian phylogenetic analysis of Semitic languages identifies an Early Bronze Age origin of Semitic in the Near East.
The evolution of languages provides a unique opportunity to study human population history. The origin of Semitic and the nature of dispersals by Semitic-speaking populations are of great importance to our understanding of the ancient history of the Middle East and Horn of Africa. Semitic populations are associated with the oldest written languages and urban civilizations in the region, which g...
متن کاملHebrew and North West Semitic: Reflections on the Classification of the Semitic Languages
1. AS IS WELL KNOWN, the comparative method has been elaborated upon with reference to the lndo-European languages. For more than a century, it has been customary to view them from the angle of both the family-tree theory and the wave hypothesis. As far as the continuity of the territory of IndoEuropean languages can be posited, it is the wave hypothesis that best explains the relation of the l...
متن کاملPii: S0010-0277(01)00120-2
In a very interesting paper, Boudelaa and Marslen-Wilson (in press) revive a 19th century theory of Semitic language morphology suggesting that the major morphological unit that conveys the core meaning of words (mainly verbs) in Semitic languages is a bi-consonantal structure labeled aetymono (Gesenius, 1817, edited and enlarged by Kautzsch). From a historical perspective, this idea has been r...
متن کاملA Comprehensive NLP System for Modern Standard Arabic and Modern Hebrew
This paper presents a comprehensive NLP system by Melingo that has been recently developed for Arabic, based on Morfix an operational formerly developed highly successful comprehensive Hebrew NLP system. The system discussed includes modules for morphological analysis, context sensitive lemmatization, vocalization, text-to-phoneme conversion, and syntactic-analysis-based prosody (intonation) ...
متن کاملLearning to Identify Semitic Roots
The standard account of word-formation processes in Semitic languages describes words as combinations of two morphemes: a root and a pattern. The root consists of consonants only, by default three (although longer roots are known), called radicals. The pattern is a combination of vowels and, possibly, consonants too, with ‘slots’ into which the root consonants can be inserted. Words are created...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010